175 research outputs found

    Translocation and deletion breakpoints in cancer genomes are associated with potential non-B DNA-forming sequences

    Get PDF
    Gross chromosomal rearrangements (including translocations, deletions, insertions and duplications) are a hallmark of cancer genomes and often create oncogenic fusion genes. An obligate step in the generation of such gross rearrangements is the formation of DNA double-strand breaks (DSBs). Since the genomic distribution of rearrangement breakpoints is non-random, intrinsic cellular factors may predispose certain genomic regions to breakage. Notably, certain DNA sequences with the potential to fold into secondary structures [potential non-B DNA structures (PONDS); e.g. triplexes, quadruplexes, hairpin/cruciforms, Z-DNA and single-stranded looped-out structures with implications in DNA replication and transcription] can stimulate the formation of DNA DSBs. Here, we tested the postulate that these DNA sequences might be found at, or in close proximity to, rearrangement breakpoints. By analyzing the distribution of PONDS-forming sequences within ±500 bases of 19 947 translocation and 46 365 sequence-characterized deletion breakpoints in cancer genomes, we find significant association between PONDS-forming repeats and cancer breakpoints. Specifically, (AT)n, (GAA)n and (GAAA)n constitute the most frequent repeats at translocation breakpoints, whereas A-tracts occur preferentially at deletion breakpoints. Translocation breakpoints near PONDS-forming repeats also recur in different individuals and patient tumor samples. Hence, PONDS-forming sequences represent an intrinsic risk factor for genomic rearrangements in cancer genomes

    Detection and characterization of local inverted repeats regularities

    Get PDF
    To explore the inverted repeats regularities along the genome sequences, we propose a sliding window method to extract the concentration scores of inverted repeats periodic regularities and the total mass of possible inverted repeats pairs. We apply the method to the human genome and locate the regions with the potential for the formation of large number of hairpin/cruciform structures. The number of found windows with periodic regularities is small and the patterns of occurrence are chromosome specific.publishe

    Local DNA dynamics shape mutational patterns of mononucleotide repeats in human genomes

    Get PDF
    Single base substitutions (SBSs) and insertions/deletions are critical for generating population diversity and can lead both to inherited disease and cancer. Whereas on a genome-wide scale SBSs are influenced by cellular factors, on a fine scale SBSs are influenced by the local DNA sequence-context, although the role of flanking sequence is often unclear. Herein, we used bioinformatics, molecular dynamics and hybrid quantum mechanics/molecular mechanics to analyze sequence context-dependent mutagenesis at mononucleotide repeats (A-tracts and G-tracts) in human population variation and in cancer genomes. SBSs and insertions/deletions occur predominantly at the first and last base-pairs of A-tracts, whereas they are concentrated at the second and third base-pairs in G-tracts. These positions correspond to the most flexible sites along A-tracts, and to sites where a ‘hole’, generated by the loss of an electron through oxidation, is most likely to be localized in G-tracts. For A-tracts, most SBSs occur in the direction of the base-pair flanking the tracts. We conclude that intrinsic features of local DNA structure, i.e. base-pair flexibility and charge transfer, render specific nucleotides along mononucleotide runs susceptible to base modification, which then yields mutations. Thus, local DNA dynamics contributes to phenotypic variation and disease in the human population

    Genome-Wide Analyses of Recombination Prone Regions Predict Role of DNA Structural Motif in Recombination

    Get PDF
    HapMap findings reveal surprisingly asymmetric distribution of recombinogenic regions. Short recombinogenic regions (hotspots) are interspersed between large relatively non-recombinogenic regions. This raises the interesting possibility of DNA sequence and/or other cis- elements as determinants of recombination. We hypothesized the involvement of non-canonical sequences that can result in local non-B DNA structures and tested this using the G-quadruplex DNA as a model. G-quadruplex or G4 DNA is a unique form of four-stranded non-B DNA structure that engages certain G-rich sequences, presence of such motifs has been noted within telomeres. In support of this hypothesis, genome-wide computational analyses presented here reveal enrichment of potential G4 (PG4) DNA forming sequences within 25618 human hotspots relative to 9290 coldspots (p<0.0001). Furthermore, co-occurrence of PG4 DNA within several short sequence elements that are associated with recombinogenic regions was found to be significantly more than randomly expected. Interestingly, analyses of more than 50 DNA binding factors revealed that co-occurrence of PG4 DNA with target DNA binding sites of transcription factors c-Rel, NF-kappa B (p50 and p65) and Evi-1 was significantly enriched in recombination-prone regions. These observations support involvement of G4 DNA in recombination, predicting a functional model that is consistent with duplex-strand separation induced by formation of G4 motifs in supercoiled DNA and/or when assisted by other cellular factors

    Distinct sequence features underlie microdeletions and gross deletions in the human genome

    Get PDF
    Microdeletions and gross deletions are important causes (~20%) of human inherited disease and their genomic locations are strongly influenced by the local DNA sequence environment. This notwithstanding, no study has systematically examined their underlying generative mechanisms. Here, we obtained 42,098 pathogenic microdeletions and gross deletions from the Human Gene Mutation Database (HGMD) that together form a continuum of germline deletions ranging in size from 1bp to 28,394,429bp. We analyzed the DNA sequence within 1-kb of the breakpoint junctions and found that the frequencies of non-B DNA-forming repeats, GC-content, and the presence of seven of 78 specific sequence motifs in the vicinity of pathogenic deletions correlated with deletion length for deletions of length ≤30 bp. Further, we found that the presence of DR, GQ and STR repeats is important for the formation of longer deletions (>30 bp) but not for the formation of shorter deletions (≤30 bp) whilst significantly (Chi-square test P-value30 bp). We provide evidence to support a functional distinction between microdeletions and gross deletions. Finally, we propose that a deletion length cut-off of 25-30bp may serve as an objective means to functionally distinguish microdeletions from gross deletions

    The Role of Methylation in the Intrinsic Dynamics of B- and Z-DNA

    Get PDF
    Methylation of cytosine at the 5-carbon position (5mC) is observed in both prokaryotes and eukaryotes. In humans, DNA methylation at CpG sites plays an important role in gene regulation and has been implicated in development, gene silencing, and cancer. In addition, the CpG dinucleotide is a known hot spot for pathologic mutations genome-wide. CpG tracts may adopt left-handed Z-DNA conformations, which have also been implicated in gene regulation and genomic instability. Methylation facilitates this B-Z transition but the underlying mechanism remains unclear. Herein, four structural models of the dinucleotide d(GC)5 repeat sequence in B-, methylated B-, Z-, and methylated Z-DNA forms were constructed and an aggregate 100 nanoseconds of molecular dynamics simulations in explicit solvent under physiological conditions was performed for each model. Both unmethylated and methylated B-DNA were found to be more flexible than Z-DNA. However, methylation significantly destabilized the BII, relative to the BI, state through the Gp5mC steps. In addition, methylation decreased the free energy difference between B- and Z-DNA. Comparisons of α/γ backbone torsional angles showed that torsional states changed marginally upon methylation for B-DNA, and Z-DNA. Methylation-induced conformational changes and lower energy differences may contribute to the transition to Z-DNA by methylated, over unmethylated, B-DNA and may be a contributing factor to biological function

    DNA models of trinucleotide frameshift deletions: the formation of loops and bulges at the primer–template junction

    Get PDF
    Although mechanisms of single-nucleotide residue deletion have been investigated, processes involved in the loss of longer nucleotide sequences during DNA replication are poorly understood. Previous reports have shown that in vitro replication of a 3′-TGC TGC template sequence can result in the deletion of one 3′-TGC. We have used low-energy circular dichroism (CD) and fluorescence spectroscopy to investigate the conformations and stabilities of DNA models of the replication intermediates that may be implicated in this frameshift. Pyrrolocytosine or 2-aminopurine residues, site-specifically substituted for cytosine or adenine in the vicinity of extruded base sequences, were used as spectroscopic probes to examine local DNA conformations. An equilibrium mixture of four hybridization conformations was observed when template bases looped-out as a bulge, i.e. a structure flanked on both sides by duplex DNA. In contrast, a single-loop structure with an unusual unstacked DNA conformation at its downstream edge was observed when the extruded bases were positioned at the primer–template junction, showing that misalignments can be modified by neighboring DNA secondary structure. These results must be taken into account in considering the genetic and biochemical mechanisms of frameshift mutagenesis in polymerase-driven DNA replication

    Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes

    Get PDF
    Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov

    Controlled Chaos of Polymorphic Mucins in a Metazoan Parasite (Schistosoma mansoni) Interacting with Its Invertebrate Host (Biomphalaria glabrata)

    Get PDF
    Invertebrates were long thought to possess only a simple, effective and hence non-adaptive defence system against microbial and parasitic attacks. However, recent studies have shown that invertebrate immunity also relies on immune receptors that diversify (e.g. in echinoderms, insects and mollusks (Biomphalaria glabrata)). Apparently, individual or population-based polymorphism-generating mechanisms exists that permit the survival of invertebrate species exposed to parasites. Consequently, the generally accepted arms race hypothesis predicts that molecular diversity and polymorphism also exist in parasites of invertebrates. We investigated the diversity and polymorphism of parasite molecules (Schistosoma mansoni Polymorphic Mucins, SmPoMucs) that are key factors for the compatibility of schistosomes interacting with their host, the mollusc Biomphalaria glabrata. We have elucidated the complex cascade of mechanisms acting both at the genomic level and during expression that confer polymorphism to SmPoMuc. We show that SmPoMuc is coded by a multi-gene family whose members frequently recombine. We show that these genes are transcribed in an individual-specific manner, and that for each gene, multiple splice variants exist. Finally, we reveal the impact of this polymorphism on the SmPoMuc glycosylation status. Our data support the view that S. mansoni has evolved a complex hierarchical system that efficiently generates a high degree of polymorphism—a “controlled chaos”—based on a relatively low number of genes. This contrasts with protozoan parasites that generate antigenic variation from large sets of genes such as Trypanosoma cruzi, Trypanosoma brucei and Plasmodium falciparum. Our data support the view that the interaction between parasites and their invertebrate hosts are far more complex than previously thought. While most studies in this matter have focused on invertebrate host diversification, we clearly show that diversifying mechanisms also exist on the parasite side of the interaction. Our findings shed new light on how and why invertebrate immunity develops
    corecore